Search CORE

779 research outputs found

Application of compiler-assisted multiple instruction rollback recovery to speculative execution

Author: Alewine N. J.
Fuchs W. K.
Hwu W.-M.
Publication venue
Publication date
Field of study

Speculative execution is a method to increase instruction level parallelism which can be exploited by both super-scalar and VLIW architectures. The key to a successful general speculation strategy is a repair mechanism to handle mispredicted branches and accurate reporting of exceptions for speculated instructions. Multiple instruction rollback is a technique developed for recovery from transient processor failure. Many of the difficulties encountered during recovery from branch misprediction or from instruction re-execution due to exception in a speculative execution architecture are similar to those encountered during multiple instruction rollback. The applicability of a recently developed compiler-assisted multiple instruction rollback scheme to aid in speculative execution repair is investigated. Extensions to the compiler-assisted scheme to support branch and exception repair are presented along with performance measurements across ten application programs

NASA Technical Reports Server

Two-dimensional tetramer-cuprate Na5RbCu4(AsO4)4Cl2: phase transitions and AFMorder as seen by 87Rb NMR

Author: A. Kriisa
Clayhold
E. Joon
Hwu
I. Heinmaa
J. Clayhold
M. Ulutagay-Kartin
R. Stern
S. Vija
S.-J. Hwu
W. Queen
X. Mo
Publication venue: 'Elsevier BV'
Publication date: 30/12/2005
Field of study

We report the Rb nuclear magnetic resonance (NMR) results in a recently synthesized Na5RbCu4(AsO4)Cl2. This complex novel two-dimensional (2D) cuprate is an unique magnetic material, which contains layers of coupled Cu4O4 tetramers. In zero applied magnetic field, it orders antiferromagnetically via a second-order low-entropy phase transition at TN = 15(1) K. We characterise the ordered state by 87Rb NMR, and suggest for it a noncollinear rather than collinear arrangement of spins. We discuss the properties of Rb nuclear site and point out the new structural phase transition(s) around 74 K and 110 K.Comment: 2 pages, 2 figures, Proceedings of SCES'05, Vienna 200

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Array concepts for solid-state and vacuum microelectronics millimeter-wave generation

Author: Hwu Ruey J.
Jou C. J.
Kim M.
Lam W. W.
Luhmann Neville C., Jr.
Popvić Zoya B.
Rutledge David B.
Publication venue
Publication date: 01/01/1989
Field of study

The authors have proposed that the increasing demand for contact watt-level coherent sources in the millimeter- and submillimeter-wave region can be satisfied by fabricating two-dimensional grids loaded with oscillators and multipliers for quasi-optical coherent spatial combining of the outputs of large numbers of low-power devices. This was first demonstrated through the successful fabrication of monolithic arrays with 2000 Schottky diodes. Watt-level power outputs were obtained in doubling to 66 GHz. In addition, a simple transmission-line model was verified with a quasi-optical reflectometer that measured the array impedance. This multiplier array work is being extended to novel tripler configurations using blocking barrier devices. The technique has also been extended to oscillator configurations where the grid structure is loaded with negative-resistance devices. This was first demonstrated using Gunn devices. More recently, a 25-element MESFET grid oscillating at 10 GHz exhibited power combining and self-locking. Currently, this approach is being extended to a 100-element monolithic array of Gunn diodes. This same approach should be applicable to planar vacuum electron devices such as the submillimeter-wave BWO (backward wave oscillator) and vacuum FET

CiteSeerX

Caltech Authors

Millimeter and submillimeter wave technology developments for the next generation of fusion devices

Author: Hwu R. J.
Kim M.
Luhmann N. C., Jr.
Popovic Z.
Rutledge D. B.
Sjogren L.
Weikel R. W.
Publication venue
Publication date: 01/10/1990
Field of study

There is increasing demand for compact watt-level coherent sources in the millimeter and submillimeter wave region. The approach that we have taken to satisfy this need is to fabricate two-dimensional grids loaded with oscillators, electronic beam steerers, and frequency multipliers for quasioptical coherent spatial combining of the outputs of a large number of low-power devices

Caltech Authors

Exposing errors related to weak memory in GPU applications

Author: Alastair F. Donaldson
Alcantara D. A. F.
Alglave J.
Alglave J.
Alglave J.
Bardsley E.
Chiang W.
Collier W. W.
Coplin J.
Feng W.
Hangal S.
Hwu W.-m. W.
Joshi S.
Lê N. M.
Sanders J.
Tyler Sorensen
Tzeng S.
Xiao S.
Yuki T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/01/2016
Field of study

© 2016 ACM.We present the systematic design of a testing environment that uses stressing and fuzzing to reveal errors in GPU applications that arise due to weak memory effects. We evaluate our approach on seven GPUS spanning three NVIDIA architectures, across ten CUDA applications that use fine-grained concurrency. Our results show that applications that rarely or never exhibit errors related to weak memory when executed natively can readily exhibit these errors when executed in our testing environment. Our testing environment also provides a means to help identify the root causes of such errors, and automatically suggests how to insert fences that harden an application against weak memory bugs. To understand the cost of GPU fences, we benchmark applications with fences provided by the hardening strategy as well as a more conservative, sound fencing strategy

Crossref

Spiral - Imperial College Digital Repository

GPU Concurrency: Weak Behaviours and Programming Assumptions

Author: AMD.
AMD.
AMD.
Cederman D.
Cederman D.
Collier W.
Core ARM.
Feng W.
Hower D. R.
Hwu W.-m. W.
Khronos OpenCL Working Group
Sanders J.
Sorensen T.
Southern AMD.
Stuart J. A.
Weaver D. L.
Xiao S.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/11/2014
Field of study

Concurrency is pervasive and perplexing, particularly on graphics processing units (GPUs). Current specifications of languages and hardware are inconclusive; thus programmers often rely on folklore assumptions when writing software. To remedy this state of affairs, we conducted a large empirical study of the concurrent behaviour of deployed GPUs. Armed with litmus tests (i.e. short concurrent programs), we questioned the assumptions in programming guides and vendor documentation about the guarantees provided by hardware. We developed a tool to generate thousands of litmus tests and run them under stressful workloads. We observed a litany of previously elusive weak behaviours, and exposed folklore beliefs about GPU programming---often supported by official tutorials---as false. As a way forward, we propose a model of Nvidia GPU hardware, which correctly models every behaviour witnessed in our experiments. The model is a variant of SPARC Relaxed Memory Order (RMO), structured following the GPU concurrency hierarchy

Crossref

Oxford University Research Archive

Kent Academic Repository

Spiral - Imperial College Digital Repository

Application of Compiler-Assisted Multiple Instruction Rollback Recovery to Speculative Execution

Author: Alewine N.J.
Fuchs W.K.
Hwu W.-M.
Publication venue: Center for Reliable and High-Performance Computing, Coordinated Science Laboratory, University of Illinois at Urbana-Champaign
Publication date: 01/07/1993
Field of study

Coordinated Science Laboratory was formerly known as Control Systems LaboratoryNational Aeronautics and Space Administration / NASA NAG 1-613Department of the Navy managed by the Office of the Chief of Naval Research / N00014-91-J-128

Illinois Digital Environment for Access to Learning and Scholarship Repository

Single-Pass Memory System Evaluation for Multiprogramming Workloads

Author: Conte Thomas M.
Hwu Wen-mei W.
Publication venue: Coordinated Science Laboratory, University of Illinois at Urbana-Champaign
Publication date: 01/05/1990
Field of study

Coordinated Science Laboratory was formerly known as Control Systems LaboratoryNational Science Foundation (NSF) / MIP-8809478NCRNational Aeronautics and Space Administration (NASA) / NASA NAG 1-613Office of Naval Research / N00014-88-K-065

Illinois Digital Environment for Access to Learning and Scholarship Repository

NASA Technical Reports Server

The Susceptibility of Programs to Context Switching

Author: Conte Thomas M.
Hwu Wen-mei W.
Publication venue: Center for Reliable and High-Performance Computing, Coordinated Science Laboratory, University of Illinois at Urbana-Champaign
Publication date: 01/04/1991
Field of study

Coordinated Science Laboratory was formerly known as Control Systems LaboratoryNational Science Foundation / MIP-8809478NCR Corp.AMD Corp. 29K Advanced Processor Development DivisionNational Aeronautics and Space Administration / NASA NAG 1-613Office of Naval Research / N00014-88-K-0656Hewlett-Packard Co

Illinois Digital Environment for Access to Learning and Scholarship Repository